Overview
Brought to you by YData
Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 45466 |
| Missing cells | 506 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 27 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Categorical | 2 |
|---|---|
| Text | 9 |
| Unsupported | 1 |
| Numeric | 4 |
| Boolean | 1 |
| Dataset has 27 (0.1%) duplicate rows | Duplicates |
revenue is highly overall correlated with vote_count | High correlation |
vote_count is highly overall correlated with revenue | High correlation |
adult is highly imbalanced (99.8%) | Imbalance |
status is highly imbalanced (97.0%) | Imbalance |
video is highly imbalanced (97.9%) | Imbalance |
popularity is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
revenue has 38052 (83.7%) zeros | Zeros |
runtime has 1558 (3.4%) zeros | Zeros |
vote_average has 2998 (6.6%) zeros | Zeros |
vote_count has 2899 (6.4%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-05 09:18:47.426243 |
|---|---|
| Analysis finished | 2025-03-05 09:18:50.482955 |
| Duration | 3.06 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
adult
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
| False | |
|---|---|
| True | 9 |
| - Written by Ørnås | 1 |
| Rune Balot goes to a casino connected to the October corporation to try to wrap up her case once and for all. | 1 |
| Avalanche Sharks tells the story of a bikini contest that turns into a horrifying affair when it is hit by a shark avalanche. | 1 |
Length
| Max length | 126 |
|---|---|
| Median length | 5 |
| Mean length | 5.0050807 |
| Min length | 4 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | False |
|---|---|
| 2nd row | False |
| 3rd row | False |
| 4th row | False |
| 5th row | False |
Common Values
| Value | Count | Frequency (%) |
| False | 45454 | |
| True | 9 | < 0.1% |
| - Written by Ørnås | 1 | < 0.1% |
| Rune Balot goes to a casino connected to the October corporation to try to wrap up her case once and for all. | 1 | < 0.1% |
| Avalanche Sharks tells the story of a bikini contest that turns into a horrifying affair when it is hit by a shark avalanche. | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| false | 45454 | |
| true | 9 | < 0.1% |
| to | 4 | < 0.1% |
| a | 4 | < 0.1% |
| the | 2 | < 0.1% |
| avalanche | 2 | < 0.1% |
| by | 2 | < 0.1% |
| when | 1 | < 0.1% |
| contest | 1 | < 0.1% |
| hit | 1 | < 0.1% |
| Other values (32) | 32 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 45479 | |
| a | 45475 | |
| s | 45465 | |
| l | 45461 | |
| F | 45454 | |
| 49 | < 0.1% | |
| r | 25 | < 0.1% |
| t | 23 | < 0.1% |
| o | 19 | < 0.1% |
| n | 17 | < 0.1% |
| Other values (24) | 94 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 227561 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 45479 | |
| a | 45475 | |
| s | 45465 | |
| l | 45461 | |
| F | 45454 | |
| 49 | < 0.1% | |
| r | 25 | < 0.1% |
| t | 23 | < 0.1% |
| o | 19 | < 0.1% |
| n | 17 | < 0.1% |
| Other values (24) | 94 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 227561 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 45479 | |
| a | 45475 | |
| s | 45465 | |
| l | 45461 | |
| F | 45454 | |
| 49 | < 0.1% | |
| r | 25 | < 0.1% |
| t | 23 | < 0.1% |
| o | 19 | < 0.1% |
| n | 17 | < 0.1% |
| Other values (24) | 94 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 227561 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 45479 | |
| a | 45475 | |
| s | 45465 | |
| l | 45461 | |
| F | 45454 | |
| 49 | < 0.1% | |
| r | 25 | < 0.1% |
| t | 23 | < 0.1% |
| o | 19 | < 0.1% |
| n | 17 | < 0.1% |
| Other values (24) | 94 | < 0.1% |
budget
Text
| Distinct | 1226 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 1 |
| Mean length | 2.2153917 |
| Min length | 1 |
Unique
| Unique | 839 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 30000000 |
|---|---|
| 2nd row | 65000000 |
| 3rd row | 0 |
| 4th row | 16000000 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 36573 | |
| 5000000 | 286 | 0.6% |
| 10000000 | 259 | 0.6% |
| 20000000 | 243 | 0.5% |
| 2000000 | 242 | 0.5% |
| 15000000 | 226 | 0.5% |
| 3000000 | 223 | 0.5% |
| 25000000 | 206 | 0.5% |
| 1000000 | 197 | 0.4% |
| 30000000 | 190 | 0.4% |
| Other values (1216) | 6821 | 15.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 84525 | |
| 1 | 3222 | 3.2% |
| 5 | 3201 | 3.2% |
| 2 | 2555 | 2.5% |
| 3 | 1792 | 1.8% |
| 4 | 1325 | 1.3% |
| 6 | 1147 | 1.1% |
| 7 | 1119 | 1.1% |
| 8 | 1102 | 1.1% |
| 9 | 660 | 0.7% |
| Other values (39) | 77 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100725 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 84525 | |
| 1 | 3222 | 3.2% |
| 5 | 3201 | 3.2% |
| 2 | 2555 | 2.5% |
| 3 | 1792 | 1.8% |
| 4 | 1325 | 1.3% |
| 6 | 1147 | 1.1% |
| 7 | 1119 | 1.1% |
| 8 | 1102 | 1.1% |
| 9 | 660 | 0.7% |
| Other values (39) | 77 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100725 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 84525 | |
| 1 | 3222 | 3.2% |
| 5 | 3201 | 3.2% |
| 2 | 2555 | 2.5% |
| 3 | 1792 | 1.8% |
| 4 | 1325 | 1.3% |
| 6 | 1147 | 1.1% |
| 7 | 1119 | 1.1% |
| 8 | 1102 | 1.1% |
| 9 | 660 | 0.7% |
| Other values (39) | 77 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100725 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 84525 | |
| 1 | 3222 | 3.2% |
| 5 | 3201 | 3.2% |
| 2 | 2555 | 2.5% |
| 3 | 1792 | 1.8% |
| 4 | 1325 | 1.3% |
| 6 | 1147 | 1.1% |
| 7 | 1119 | 1.1% |
| 8 | 1102 | 1.1% |
| 9 | 660 | 0.7% |
| Other values (39) | 77 | 0.1% |
genres
Text
| Distinct | 4069 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
Length
| Max length | 264 |
|---|---|
| Median length | 225 |
| Mean length | 62.822131 |
| Min length | 2 |
Unique
| Unique | 2365 ? |
|---|---|
| Unique (%) | 5.2% |
Sample
| 1st row | [{'id': 16, 'name': 'Animation'}, {'id': 35, 'name': 'Comedy'}, {'id': 10751, 'name': 'Family'}] |
|---|---|
| 2nd row | [{'id': 12, 'name': 'Adventure'}, {'id': 14, 'name': 'Fantasy'}, {'id': 10751, 'name': 'Family'}] |
| 3rd row | [{'id': 10749, 'name': 'Romance'}, {'id': 35, 'name': 'Comedy'}] |
| 4th row | [{'id': 35, 'name': 'Comedy'}, {'id': 18, 'name': 'Drama'}, {'id': 10749, 'name': 'Romance'}] |
| 5th row | [{'id': 35, 'name': 'Comedy'}] |
| Value | Count | Frequency (%) |
| id | 91106 | |
| name | 91106 | |
| drama | 20265 | 5.5% |
| 18 | 20265 | 5.5% |
| 35 | 13182 | 3.6% |
| comedy | 13182 | 3.6% |
| 53 | 7624 | 2.1% |
| thriller | 7624 | 2.1% |
| romance | 6735 | 1.8% |
| 10749 | 6735 | 1.8% |
| Other values (71) | 92873 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 546636 | |
| 325231 | 11.4% | |
| : | 182212 | 6.4% |
| a | 152966 | 5.4% |
| e | 146936 | 5.1% |
| m | 144238 | 5.0% |
| , | 139188 | 4.9% |
| i | 130819 | 4.6% |
| n | 126822 | 4.4% |
| d | 107792 | 3.8% |
| Other values (46) | 853431 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2856271 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| ' | 546636 | |
| 325231 | 11.4% | |
| : | 182212 | 6.4% |
| a | 152966 | 5.4% |
| e | 146936 | 5.1% |
| m | 144238 | 5.0% |
| , | 139188 | 4.9% |
| i | 130819 | 4.6% |
| n | 126822 | 4.4% |
| d | 107792 | 3.8% |
| Other values (46) | 853431 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2856271 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| ' | 546636 | |
| 325231 | 11.4% | |
| : | 182212 | 6.4% |
| a | 152966 | 5.4% |
| e | 146936 | 5.1% |
| m | 144238 | 5.0% |
| , | 139188 | 4.9% |
| i | 130819 | 4.6% |
| n | 126822 | 4.4% |
| d | 107792 | 3.8% |
| Other values (46) | 853431 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2856271 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| ' | 546636 | |
| 325231 | 11.4% | |
| : | 182212 | 6.4% |
| a | 152966 | 5.4% |
| e | 146936 | 5.1% |
| m | 144238 | 5.0% |
| , | 139188 | 4.9% |
| i | 130819 | 4.6% |
| n | 126822 | 4.4% |
| d | 107792 | 3.8% |
| Other values (46) | 853431 |
id
Text
| Distinct | 45436 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 5.2514846 |
| Min length | 1 |
Unique
| Unique | 45407 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | 862 |
|---|---|
| 2nd row | 8844 |
| 3rd row | 15602 |
| 4th row | 31357 |
| 5th row | 11862 |
| Value | Count | Frequency (%) |
| 141971 | 3 | < 0.1% |
| 159849 | 2 | < 0.1% |
| 168538 | 2 | < 0.1% |
| 298721 | 2 | < 0.1% |
| 265189 | 2 | < 0.1% |
| 5511 | 2 | < 0.1% |
| 97995 | 2 | < 0.1% |
| 99080 | 2 | < 0.1% |
| 23305 | 2 | < 0.1% |
| 119916 | 2 | < 0.1% |
| Other values (45426) | 45445 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 238764 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 238764 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 238764 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 32923 | |
| 2 | 28625 | |
| 3 | 26732 | |
| 4 | 24747 | |
| 5 | 21996 | |
| 6 | 21184 | |
| 7 | 20949 | |
| 8 | 20909 | |
| 9 | 20485 | |
| 0 | 20208 |
imdb_id
Text
| Distinct | 45417 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 17 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9994719 |
| Min length | 1 |
Unique
| Unique | 45387 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | tt0114709 |
|---|---|
| 2nd row | tt0113497 |
| 3rd row | tt0113228 |
| 4th row | tt0114885 |
| 5th row | tt0113041 |
| Value | Count | Frequency (%) |
| tt1180333 | 3 | < 0.1% |
| 0 | 3 | < 0.1% |
| tt0046468 | 2 | < 0.1% |
| tt1327820 | 2 | < 0.1% |
| tt2818654 | 2 | < 0.1% |
| tt0111613 | 2 | < 0.1% |
| tt1821641 | 2 | < 0.1% |
| tt0127834 | 2 | < 0.1% |
| tt0295682 | 2 | < 0.1% |
| tt0080000 | 2 | < 0.1% |
| Other values (45407) | 45427 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 409017 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 409017 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 409017 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 90892 | |
| 0 | 69913 | |
| 1 | 37232 | |
| 2 | 31234 | 7.6% |
| 4 | 28498 | 7.0% |
| 3 | 28135 | 6.9% |
| 8 | 25445 | 6.2% |
| 6 | 25442 | 6.2% |
| 5 | 24253 | 5.9% |
| 7 | 24221 | 5.9% |
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.000154 |
| Min length | 2 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
| Value | Count | Frequency (%) |
| en | 32269 | |
| fr | 2438 | 5.4% |
| it | 1529 | 3.4% |
| ja | 1350 | 3.0% |
| de | 1080 | 2.4% |
| es | 994 | 2.2% |
| ru | 826 | 1.8% |
| hi | 508 | 1.1% |
| ko | 444 | 1.0% |
| zh | 409 | 0.9% |
| Other values (82) | 3608 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 90917 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 90917 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 90917 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 34598 | |
| n | 32978 | |
| r | 3636 | 4.0% |
| f | 2839 | 3.1% |
| i | 2391 | 2.6% |
| t | 2252 | 2.5% |
| a | 1841 | 2.0% |
| s | 1654 | 1.8% |
| j | 1351 | 1.5% |
| d | 1325 | 1.5% |
| Other values (23) | 6052 | 6.7% |
original_title
Text
| Distinct | 43373 |
|---|---|
| Distinct (%) | 95.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.3 KiB |
Length
| Max length | 109 |
|---|---|
| Median length | 84 |
| Mean length | 16.323494 |
| Min length | 1 |
Unique
| Unique | 41712 ? |
|---|---|
| Unique (%) | 91.7% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
| Value | Count | Frequency (%) |
| the | 10261 | 7.8% |
| of | 3309 | 2.5% |
| a | 1674 | 1.3% |
| in | 1275 | 1.0% |
| and | 1072 | 0.8% |
| la | 1007 | 0.8% |
| 863 | 0.7% | |
| to | 806 | 0.6% |
| de | 702 | 0.5% |
| man | 509 | 0.4% |
| Other values (35324) | 110301 |
Most occurring characters
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 742164 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 742164 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 742164 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 86293 | 11.6% | |
| e | 70665 | 9.5% |
| a | 49100 | 6.6% |
| o | 42066 | 5.7% |
| i | 39494 | 5.3% |
| n | 39149 | 5.3% |
| r | 37728 | 5.1% |
| t | 33530 | 4.5% |
| s | 28615 | 3.9% |
| l | 25557 | 3.4% |
| Other values (2936) | 289967 |
popularity
Unsupported
Rejected  Unsupported 
| Missing | 5 |
|---|---|
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
release_date
Text
| Distinct | 17336 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 87 |
| Missing (%) | 0.2% |
| Memory size | 355.3 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9994491 |
| Min length | 1 |
Unique
| Unique | 8573 ? |
|---|---|
| Unique (%) | 18.9% |
Sample
| 1st row | 1995-10-30 |
|---|---|
| 2nd row | 1995-12-15 |
| 3rd row | 1995-12-22 |
| 4th row | 1995-12-22 |
| 5th row | 1995-02-10 |
| Value | Count | Frequency (%) |
| 2008-01-01 | 136 | 0.3% |
| 2009-01-01 | 121 | 0.3% |
| 2007-01-01 | 118 | 0.3% |
| 2005-01-01 | 111 | 0.2% |
| 2006-01-01 | 101 | 0.2% |
| 2002-01-01 | 96 | 0.2% |
| 2004-01-01 | 90 | 0.2% |
| 2001-01-01 | 84 | 0.2% |
| 2003-01-01 | 76 | 0.2% |
| 1997-01-01 | 69 | 0.2% |
| Other values (17326) | 44377 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84056 | |
| 2 | 52806 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 453765 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84056 | |
| 2 | 52806 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 453765 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84056 | |
| 2 | 52806 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 453765 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 97600 | |
| - | 90752 | |
| 1 | 84056 | |
| 2 | 52806 | |
| 9 | 39773 | |
| 3 | 15435 | 3.4% |
| 8 | 15279 | 3.4% |
| 6 | 15021 | 3.3% |
| 5 | 14836 | 3.3% |
| 7 | 14289 | 3.1% |
revenue
Real number (ℝ)
High correlation  Zeros 
| Distinct | 6863 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11209349 |
| Minimum | 0 |
|---|---|
| Maximum | 2.7879651 × 109 |
| Zeros | 38052 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 47808918 |
| Maximum | 2.7879651 × 109 |
| Range | 2.7879651 × 109 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 64332247 |
|---|---|
| Coefficient of variation (CV) | 5.7391602 |
| Kurtosis | 237.51059 |
| Mean | 11209349 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.265983 |
| Sum | 5.0957698 × 1011 |
| Variance | 4.138638 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38052 | |
| 12000000 | 20 | < 0.1% |
| 10000000 | 19 | < 0.1% |
| 11000000 | 19 | < 0.1% |
| 2000000 | 18 | < 0.1% |
| 6000000 | 17 | < 0.1% |
| 5000000 | 14 | < 0.1% |
| 500000 | 13 | < 0.1% |
| 8000000 | 13 | < 0.1% |
| 1 | 12 | < 0.1% |
| Other values (6853) | 7263 | 16.0% |
| Value | Count | Frequency (%) |
| 0 | 38052 | |
| 1 | 12 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 9 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2787965087 | 1 | |
| 2068223624 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 | |
| 1506249360 | 1 | |
| 1405403694 | 1 | |
| 1342000000 | 1 | |
| 1274219009 | 1 | |
| 1262886337 | 1 |
runtime
Real number (ℝ)
Zeros 
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 263 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.128199 |
| Minimum | 0 |
|---|---|
| Maximum | 1256 |
| Zeros | 1558 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 85 |
| median | 95 |
| Q3 | 107 |
| 95-th percentile | 138 |
| Maximum | 1256 |
| Range | 1256 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 38.40781 |
|---|---|
| Coefficient of variation (CV) | 0.40803724 |
| Kurtosis | 93.217158 |
| Mean | 94.128199 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 4.4659579 |
| Sum | 4254877 |
| Variance | 1475.1599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 2556 | 5.6% |
| 0 | 1558 | 3.4% |
| 100 | 1470 | 3.2% |
| 95 | 1412 | 3.1% |
| 93 | 1214 | 2.7% |
| 96 | 1104 | 2.4% |
| 92 | 1080 | 2.4% |
| 94 | 1062 | 2.3% |
| 91 | 1057 | 2.3% |
| 88 | 1032 | 2.3% |
| Other values (343) | 31658 |
| Value | Count | Frequency (%) |
| 0 | 1558 | |
| 1 | 107 | 0.2% |
| 2 | 33 | 0.1% |
| 3 | 48 | 0.1% |
| 4 | 51 | 0.1% |
| 5 | 51 | 0.1% |
| 6 | 72 | 0.2% |
| 7 | 103 | 0.2% |
| 8 | 78 | 0.2% |
| 9 | 63 | 0.1% |
| Value | Count | Frequency (%) |
| 1256 | 1 | |
| 1140 | 2 | |
| 931 | 1 | |
| 925 | 1 | |
| 900 | 1 | |
| 877 | 1 | |
| 874 | 1 | |
| 840 | 2 | |
| 780 | 1 | |
| 720 | 1 |
spoken_languages
Text
| Distinct | 1931 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 765 |
|---|---|
| Median length | 40 |
| Mean length | 46.928289 |
| Min length | 2 |
Unique
| Unique | 1366 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | [{'iso_639_1': 'en', 'name': 'English'}] |
|---|---|
| 2nd row | [{'iso_639_1': 'en', 'name': 'English'}, {'iso_639_1': 'fr', 'name': 'Français'}] |
| 3rd row | [{'iso_639_1': 'en', 'name': 'English'}] |
| 4th row | [{'iso_639_1': 'en', 'name': 'English'}] |
| 5th row | [{'iso_639_1': 'en', 'name': 'English'}] |
| Value | Count | Frequency (%) |
| iso_639_1 | 53300 | |
| name | 53300 | |
| english | 28745 | |
| en | 28745 | |
| 4809 | 2.2% | |
| fr | 4196 | 1.9% |
| français | 4196 | 1.9% |
| deutsch | 2625 | 1.2% |
| de | 2625 | 1.2% |
| es | 2413 | 1.1% |
| Other values (203) | 33488 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 426400 | |
| 172982 | 8.1% | |
| n | 120605 | 5.7% |
| _ | 106600 | 5.0% |
| : | 106600 | 5.0% |
| s | 99222 | 4.7% |
| i | 94120 | 4.4% |
| e | 92748 | 4.3% |
| a | 75235 | 3.5% |
| , | 64969 | 3.0% |
| Other values (174) | 773879 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2133360 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| ' | 426400 | |
| 172982 | 8.1% | |
| n | 120605 | 5.7% |
| _ | 106600 | 5.0% |
| : | 106600 | 5.0% |
| s | 99222 | 4.7% |
| i | 94120 | 4.4% |
| e | 92748 | 4.3% |
| a | 75235 | 3.5% |
| , | 64969 | 3.0% |
| Other values (174) | 773879 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2133360 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| ' | 426400 | |
| 172982 | 8.1% | |
| n | 120605 | 5.7% |
| _ | 106600 | 5.0% |
| : | 106600 | 5.0% |
| s | 99222 | 4.7% |
| i | 94120 | 4.4% |
| e | 92748 | 4.3% |
| a | 75235 | 3.5% |
| , | 64969 | 3.0% |
| Other values (174) | 773879 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2133360 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| ' | 426400 | |
| 172982 | 8.1% | |
| n | 120605 | 5.7% |
| _ | 106600 | 5.0% |
| : | 106600 | 5.0% |
| s | 99222 | 4.7% |
| i | 94120 | 4.4% |
| e | 92748 | 4.3% |
| a | 75235 | 3.5% |
| , | 64969 | 3.0% |
| Other values (174) | 773879 |
status
Categorical
Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 87 |
| Missing (%) | 0.2% |
| Memory size | 355.3 KiB |
| Released | |
|---|---|
| Rumored | 230 |
| Post Production | 98 |
| In Production | 20 |
| Planned | 15 |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.0119218 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Released |
|---|---|
| 2nd row | Released |
| 3rd row | Released |
| 4th row | Released |
| 5th row | Released |
Common Values
| Value | Count | Frequency (%) |
| Released | 45014 | |
| Rumored | 230 | 0.5% |
| Post Production | 98 | 0.2% |
| In Production | 20 | < 0.1% |
| Planned | 15 | < 0.1% |
| Canceled | 2 | < 0.1% |
| (Missing) | 87 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| released | 45014 | |
| rumored | 230 | 0.5% |
| production | 118 | 0.3% |
| post | 98 | 0.2% |
| in | 20 | < 0.1% |
| planned | 15 | < 0.1% |
| canceled | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 363573 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 363573 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 363573 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 135291 | |
| d | 45379 | 12.5% |
| R | 45244 | 12.4% |
| s | 45112 | 12.4% |
| l | 45031 | 12.4% |
| a | 45031 | 12.4% |
| o | 564 | 0.2% |
| r | 348 | 0.1% |
| u | 348 | 0.1% |
| P | 231 | 0.1% |
| Other values (8) | 994 | 0.3% |
title
Text
| Distinct | 42277 |
|---|---|
| Distinct (%) | 93.0% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
Length
| Max length | 105 |
|---|---|
| Median length | 79 |
| Mean length | 16.708535 |
| Min length | 1 |
Unique
| Unique | 39947 ? |
|---|---|
| Unique (%) | 87.9% |
Sample
| 1st row | Toy Story |
|---|---|
| 2nd row | Jumanji |
| 3rd row | Grumpier Old Men |
| 4th row | Waiting to Exhale |
| 5th row | Father of the Bride Part II |
| Value | Count | Frequency (%) |
| the | 14571 | 10.7% |
| of | 4938 | 3.6% |
| a | 2244 | 1.6% |
| in | 1697 | 1.2% |
| and | 1634 | 1.2% |
| to | 1055 | 0.8% |
| 763 | 0.6% | |
| man | 665 | 0.5% |
| love | 664 | 0.5% |
| for | 602 | 0.4% |
| Other values (24431) | 107634 |
Most occurring characters
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 759570 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 759570 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 759570 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 91029 | 12.0% | |
| e | 76408 | 10.1% |
| a | 49056 | 6.5% |
| o | 45765 | 6.0% |
| n | 40931 | 5.4% |
| r | 40096 | 5.3% |
| i | 39859 | 5.2% |
| t | 36792 | 4.8% |
| s | 29591 | 3.9% |
| h | 28564 | 3.8% |
| Other values (277) | 281479 |
video
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 355.3 KiB |
| False | |
|---|---|
| True | 93 |
| (Missing) | 6 |
| Value | Count | Frequency (%) |
| False | 45367 | |
| True | 93 | 0.2% |
| (Missing) | 6 | < 0.1% |
vote_average
Real number (ℝ)
Zeros 
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6182072 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 2998 |
| Zeros (%) | 6.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6.8 |
| 95-th percentile | 7.8 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 1.924216 |
|---|---|
| Coefficient of variation (CV) | 0.34249644 |
| Kurtosis | 2.5004022 |
| Mean | 5.6182072 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | -1.5189901 |
| Sum | 255403.7 |
| Variance | 3.7026072 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2998 | 6.6% |
| 6 | 2468 | 5.4% |
| 5 | 2001 | 4.4% |
| 7 | 1886 | 4.1% |
| 6.5 | 1722 | 3.8% |
| 6.3 | 1603 | 3.5% |
| 5.5 | 1381 | 3.0% |
| 5.8 | 1369 | 3.0% |
| 6.4 | 1350 | 3.0% |
| 6.7 | 1342 | 3.0% |
| Other values (82) | 27340 |
| Value | Count | Frequency (%) |
| 0 | 2998 | |
| 0.5 | 13 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 105 | 0.2% |
| 1.1 | 1 | < 0.1% |
| 1.2 | 4 | < 0.1% |
| 1.3 | 13 | < 0.1% |
| 1.4 | 5 | < 0.1% |
| 1.5 | 30 | 0.1% |
| 1.6 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 190 | |
| 9.8 | 1 | < 0.1% |
| 9.6 | 1 | < 0.1% |
| 9.5 | 18 | < 0.1% |
| 9.4 | 3 | < 0.1% |
| 9.3 | 18 | < 0.1% |
| 9.2 | 4 | < 0.1% |
| 9.1 | 3 | < 0.1% |
| 9 | 159 | |
| 8.9 | 7 | < 0.1% |
vote_count
Real number (ℝ)
High correlation  Zeros 
| Distinct | 1820 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 109.89734 |
| Minimum | 0 |
|---|---|
| Maximum | 14075 |
| Zeros | 2899 |
| Zeros (%) | 6.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 355.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 10 |
| Q3 | 34 |
| 95-th percentile | 434 |
| Maximum | 14075 |
| Range | 14075 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 491.31037 |
|---|---|
| Coefficient of variation (CV) | 4.4706303 |
| Kurtosis | 151.2028 |
| Mean | 109.89734 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 10.450232 |
| Sum | 4995933 |
| Variance | 241385.88 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3264 | 7.2% |
| 2 | 3132 | 6.9% |
| 0 | 2899 | 6.4% |
| 3 | 2787 | 6.1% |
| 4 | 2480 | 5.5% |
| 5 | 2097 | 4.6% |
| 6 | 1747 | 3.8% |
| 7 | 1570 | 3.5% |
| 8 | 1359 | 3.0% |
| 9 | 1194 | 2.6% |
| Other values (1810) | 22931 |
| Value | Count | Frequency (%) |
| 0 | 2899 | |
| 1 | 3264 | |
| 2 | 3132 | |
| 3 | 2787 | |
| 4 | 2480 | |
| 5 | 2097 | |
| 6 | 1747 | |
| 7 | 1570 | |
| 8 | 1359 | |
| 9 | 1194 | 2.6% |
| Value | Count | Frequency (%) |
| 14075 | 1 | |
| 12269 | 1 | |
| 12114 | 1 | |
| 12000 | 1 | |
| 11444 | 1 | |
| 11187 | 1 | |
| 10297 | 1 | |
| 10014 | 1 | |
| 9678 | 1 | |
| 9634 | 1 |
Interactions
Correlations
| adult | revenue | runtime | status | video | vote_average | vote_count | |
|---|---|---|---|---|---|---|---|
| adult | 1.000 | 0.000 | 0.000 | 0.074 | 0.000 | 0.019 | 0.000 |
| revenue | 0.000 | 1.000 | 0.254 | 0.000 | 0.000 | 0.127 | 0.513 |
| runtime | 0.000 | 0.254 | 1.000 | 0.000 | 0.059 | 0.194 | 0.291 |
| status | 0.074 | 0.000 | 0.000 | 1.000 | 0.000 | 0.019 | 0.000 |
| video | 0.000 | 0.000 | 0.059 | 0.000 | 1.000 | 0.047 | 0.000 |
| vote_average | 0.019 | 0.127 | 0.194 | 0.019 | 0.047 | 1.000 | 0.320 |
| vote_count | 0.000 | 0.513 | 0.291 | 0.000 | 0.000 | 0.320 | 1.000 |
Missing values
Sample
| adult | budget | genres | id | imdb_id | original_language | original_title | popularity | release_date | revenue | runtime | spoken_languages | status | title | video | vote_average | vote_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | False | 30000000 | [{'id': 16, 'name': 'Animation'}, {'id': 35, 'name': 'Comedy'}, {'id': 10751, 'name': 'Family'}] | 862 | tt0114709 | en | Toy Story | 21.946943 | 1995-10-30 | 373554033.0 | 81.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Toy Story | False | 7.7 | 5415.0 |
| 1 | False | 65000000 | [{'id': 12, 'name': 'Adventure'}, {'id': 14, 'name': 'Fantasy'}, {'id': 10751, 'name': 'Family'}] | 8844 | tt0113497 | en | Jumanji | 17.015539 | 1995-12-15 | 262797249.0 | 104.0 | [{'iso_639_1': 'en', 'name': 'English'}, {'iso_639_1': 'fr', 'name': 'Français'}] | Released | Jumanji | False | 6.9 | 2413.0 |
| 2 | False | 0 | [{'id': 10749, 'name': 'Romance'}, {'id': 35, 'name': 'Comedy'}] | 15602 | tt0113228 | en | Grumpier Old Men | 11.7129 | 1995-12-22 | 0.0 | 101.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Grumpier Old Men | False | 6.5 | 92.0 |
| 3 | False | 16000000 | [{'id': 35, 'name': 'Comedy'}, {'id': 18, 'name': 'Drama'}, {'id': 10749, 'name': 'Romance'}] | 31357 | tt0114885 | en | Waiting to Exhale | 3.859495 | 1995-12-22 | 81452156.0 | 127.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Waiting to Exhale | False | 6.1 | 34.0 |
| 4 | False | 0 | [{'id': 35, 'name': 'Comedy'}] | 11862 | tt0113041 | en | Father of the Bride Part II | 8.387519 | 1995-02-10 | 76578911.0 | 106.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Father of the Bride Part II | False | 5.7 | 173.0 |
| 5 | False | 60000000 | [{'id': 28, 'name': 'Action'}, {'id': 80, 'name': 'Crime'}, {'id': 18, 'name': 'Drama'}, {'id': 53, 'name': 'Thriller'}] | 949 | tt0113277 | en | Heat | 17.924927 | 1995-12-15 | 187436818.0 | 170.0 | [{'iso_639_1': 'en', 'name': 'English'}, {'iso_639_1': 'es', 'name': 'Español'}] | Released | Heat | False | 7.7 | 1886.0 |
| 6 | False | 58000000 | [{'id': 35, 'name': 'Comedy'}, {'id': 10749, 'name': 'Romance'}] | 11860 | tt0114319 | en | Sabrina | 6.677277 | 1995-12-15 | 0.0 | 127.0 | [{'iso_639_1': 'fr', 'name': 'Français'}, {'iso_639_1': 'en', 'name': 'English'}] | Released | Sabrina | False | 6.2 | 141.0 |
| 7 | False | 0 | [{'id': 28, 'name': 'Action'}, {'id': 12, 'name': 'Adventure'}, {'id': 18, 'name': 'Drama'}, {'id': 10751, 'name': 'Family'}] | 45325 | tt0112302 | en | Tom and Huck | 2.561161 | 1995-12-22 | 0.0 | 97.0 | [{'iso_639_1': 'en', 'name': 'English'}, {'iso_639_1': 'de', 'name': 'Deutsch'}] | Released | Tom and Huck | False | 5.4 | 45.0 |
| 8 | False | 35000000 | [{'id': 28, 'name': 'Action'}, {'id': 12, 'name': 'Adventure'}, {'id': 53, 'name': 'Thriller'}] | 9091 | tt0114576 | en | Sudden Death | 5.23158 | 1995-12-22 | 64350171.0 | 106.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Sudden Death | False | 5.5 | 174.0 |
| 9 | False | 58000000 | [{'id': 12, 'name': 'Adventure'}, {'id': 28, 'name': 'Action'}, {'id': 53, 'name': 'Thriller'}] | 710 | tt0113189 | en | GoldenEye | 14.686036 | 1995-11-16 | 352194034.0 | 130.0 | [{'iso_639_1': 'en', 'name': 'English'}, {'iso_639_1': 'ru', 'name': 'Pусский'}, {'iso_639_1': 'es', 'name': 'Español'}] | Released | GoldenEye | False | 6.6 | 1194.0 |
| adult | budget | genres | id | imdb_id | original_language | original_title | popularity | release_date | revenue | runtime | spoken_languages | status | title | video | vote_average | vote_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45456 | False | 0 | [{'id': 27, 'name': 'Horror'}, {'id': 9648, 'name': 'Mystery'}, {'id': 53, 'name': 'Thriller'}] | 84419 | tt0038621 | en | House of Horrors | 0.222814 | 1946-03-29 | 0.0 | 65.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | House of Horrors | False | 6.3 | 8.0 |
| 45457 | False | 0 | [{'id': 9648, 'name': 'Mystery'}, {'id': 27, 'name': 'Horror'}] | 390959 | tt0265736 | en | Shadow of the Blair Witch | 0.076061 | 2000-10-22 | 0.0 | 45.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Shadow of the Blair Witch | False | 7.0 | 2.0 |
| 45458 | False | 0 | [{'id': 27, 'name': 'Horror'}] | 289923 | tt0252966 | en | The Burkittsville 7 | 0.38645 | 2000-10-03 | 0.0 | 30.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | The Burkittsville 7 | False | 7.0 | 1.0 |
| 45459 | False | 0 | [{'id': 878, 'name': 'Science Fiction'}] | 222848 | tt0112613 | en | Caged Heat 3000 | 0.661558 | 1995-01-01 | 0.0 | 85.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Caged Heat 3000 | False | 3.5 | 1.0 |
| 45460 | False | 0 | [{'id': 18, 'name': 'Drama'}, {'id': 28, 'name': 'Action'}, {'id': 10749, 'name': 'Romance'}] | 30840 | tt0102797 | en | Robin Hood | 5.683753 | 1991-05-13 | 0.0 | 104.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Robin Hood | False | 5.7 | 26.0 |
| 45461 | False | 0 | [{'id': 18, 'name': 'Drama'}, {'id': 10751, 'name': 'Family'}] | 439050 | tt6209470 | fa | رگ خواب | 0.072051 | NaN | 0.0 | 90.0 | [{'iso_639_1': 'fa', 'name': 'فارسی'}] | Released | Subdue | False | 4.0 | 1.0 |
| 45462 | False | 0 | [{'id': 18, 'name': 'Drama'}] | 111109 | tt2028550 | tl | Siglo ng Pagluluwal | 0.178241 | 2011-11-17 | 0.0 | 360.0 | [{'iso_639_1': 'tl', 'name': ''}] | Released | Century of Birthing | False | 9.0 | 3.0 |
| 45463 | False | 0 | [{'id': 28, 'name': 'Action'}, {'id': 18, 'name': 'Drama'}, {'id': 53, 'name': 'Thriller'}] | 67758 | tt0303758 | en | Betrayal | 0.903007 | 2003-08-01 | 0.0 | 90.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Betrayal | False | 3.8 | 6.0 |
| 45464 | False | 0 | [] | 227506 | tt0008536 | en | Satana likuyushchiy | 0.003503 | 1917-10-21 | 0.0 | 87.0 | [] | Released | Satan Triumphant | False | 0.0 | 0.0 |
| 45465 | False | 0 | [] | 461257 | tt6980792 | en | Queerama | 0.163015 | 2017-06-09 | 0.0 | 75.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Queerama | False | 0.0 | 0.0 |
Duplicate rows
Most frequently occurring
| adult | budget | genres | id | imdb_id | original_language | original_title | release_date | revenue | runtime | spoken_languages | status | title | video | vote_average | vote_count | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 15 | False | 0 | [{'id': 53, 'name': 'Thriller'}, {'id': 9648, 'name': 'Mystery'}] | 141971 | tt1180333 | fi | Blackout | 2008-12-26 | 0.0 | 108.0 | [{'iso_639_1': 'fi', 'name': 'suomi'}] | Released | Blackout | False | 6.7 | 3.0 | 3 |
| 0 | False | 0 | [{'id': 12, 'name': 'Adventure'}, {'id': 14, 'name': 'Fantasy'}, {'id': 16, 'name': 'Animation'}, {'id': 878, 'name': 'Science Fiction'}, {'id': 10751, 'name': 'Family'}] | 12600 | tt0287635 | ja | 劇場版ポケットモンスター セレビィ 時を越えた遭遇(であい) | 2001-07-06 | 28023563.0 | 75.0 | [{'iso_639_1': 'ja', 'name': '日本語'}] | Released | Pokémon 4Ever: Celebi - Voice of the Forest | False | 5.7 | 82.0 | 2 |
| 1 | False | 0 | [{'id': 12, 'name': 'Adventure'}, {'id': 16, 'name': 'Animation'}, {'id': 18, 'name': 'Drama'}, {'id': 28, 'name': 'Action'}, {'id': 10769, 'name': 'Foreign'}] | 23305 | tt0295682 | en | The Warrior | 2001-09-23 | 0.0 | 86.0 | [{'iso_639_1': 'hi', 'name': 'हिन्दी'}] | Released | The Warrior | False | 6.3 | 15.0 | 2 |
| 2 | False | 0 | [{'id': 14, 'name': 'Fantasy'}, {'id': 18, 'name': 'Drama'}, {'id': 878, 'name': 'Science Fiction'}] | 119916 | tt0080000 | en | The Tempest | 1980-02-27 | 0.0 | 123.0 | [] | Released | The Tempest | False | 0.0 | 0.0 | 2 |
| 3 | False | 0 | [{'id': 18, 'name': 'Drama'}, {'id': 10749, 'name': 'Romance'}] | 105045 | tt0111613 | de | Das Versprechen | 1995-02-16 | 0.0 | 115.0 | [{'iso_639_1': 'de', 'name': 'Deutsch'}] | Released | The Promise | False | 5.0 | 1.0 | 2 |
| 4 | False | 0 | [{'id': 18, 'name': 'Drama'}, {'id': 10769, 'name': 'Foreign'}] | 42495 | tt0067306 | en | King Lear | 1971-02-04 | 0.0 | 137.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Rumored | King Lear | False | 8.0 | 3.0 | 2 |
| 5 | False | 0 | [{'id': 18, 'name': 'Drama'}, {'id': 35, 'name': 'Comedy'}] | 168538 | tt0084387 | en | Nana | 1983-06-13 | 0.0 | 92.0 | [] | Released | Nana, the True Key of Pleasure | False | 4.7 | 3.0 | 2 |
| 6 | False | 0 | [{'id': 18, 'name': 'Drama'}, {'id': 878, 'name': 'Science Fiction'}, {'id': 16, 'name': 'Animation'}] | 152795 | tt1821641 | en | The Congress | 2013-05-16 | 455815.0 | 122.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | The Congress | False | 6.4 | 165.0 | 2 |
| 7 | False | 0 | [{'id': 18, 'name': 'Drama'}] | 109962 | tt0082992 | en | Rich and Famous | 1981-09-23 | 0.0 | 115.0 | [{'iso_639_1': 'en', 'name': 'English'}] | Released | Rich and Famous | False | 4.9 | 7.0 | 2 |
| 8 | False | 0 | [{'id': 18, 'name': 'Drama'}] | 132641 | tt0046468 | ja | Tsuma | 1953-04-29 | 0.0 | 89.0 | [{'iso_639_1': 'ja', 'name': '日本語'}] | Released | Wife | False | 0.0 | 0.0 | 2 |